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Abstract 

As an alternative to current wired-based networks, wireless sensor networks (WSNs) are becoming 
an increasingly compelling platform for engineering structural health monitoring (SHM) due to relatively 
low-cost, easy installation, and so forth. However, there is still an unaddressed challenge; the application- 
specific dependability in terms of sensor fault detection and tolerance. The dependability is also affected 
by a reduction on the quality of monitoring when mitigating WSN constrains (e.g., limited energy, narrow 
bandwidth). We address these by designing a dependable distributed WSN framework for SHM (called 
DependSHM) and then examining its ability to cope with sensor faults and constraints. We find evidence 
that faulty sensors can corrupt results of a health event (e.g., damage) in a structural system without 
being detected. More specifically, we bring attention to an undiscovered yet interesting fact, i.e., the 
real measured signals introduced by one or more faulty sensors may cause an undamaged location to be 
identified as damaged (false positive) or a damaged location as undamaged (false negative) diagnosis. 

This can be caused by faults in sensor bonding, precision degradation, amplification gain, bias, drift, 
noise, and so forth. In DependSHM, we present a distributed automated algorithm to detect such types 
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of faults, and we offer an online signal reconstruction algorithm to recover from the wrong diagnosis. 
Through comprehensive simulations and a WSN prototype system implementation, we evaluate the 
effectiveness of DependSHM. 
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I. Introduction 

Wireless sensor networks (WSNs) consist of a number of sensor nodes that can collaborate 
with each other to perform monitoring tasks. WSNs have been widely deployed on the ground, 
vehicles, structures, and the like for enabling various applications, e.g., target detection, scientific 
observation, safety-related, and traffic monitoring [HI, 0, [|3l, IH, O, [|71, A WSN 
typically consists of a large number of resource-limited sensor nodes working in a self-organizing 
and distributed manner. Sensor nodes Applications of WSNs include military sensing, wildlife 
tracking, traffic surveillance, health care, environment monitoring. Recent work has explored 
that WSNs can be a compelling platform for engineering structural health monitoring (SHM), 
due to relatively low-cost, easy installation, and so forth [|9l, ifTOl . ifTTI . [[T2ll . [fT3l . In a typical 
SHM system, the interest is in monitoring possible changes (e.g., damage, crack, corrosion) on 
physical structures (e.g., aerospace vehicles, buildings, bridges, nuclear plants, etc.) and providing 
an “alert” at an early stage to reduce safety-risk. This prevails throughout the aerospace, civil, 
structural, or mechanical (ACSM) engineering communities. 

Both ACSM and computer science (CS) communities have already addressed numerous chal¬ 
lenges/requirements, including data acquisition, compression, aggregation, damage detection, 
distributed computing. However, there is still an unaddressed challenge: the application-specific 
dependability, which is the ability of a WSN providing application-specific meaningful monitor¬ 
ing results under sensor faults. Particularly, such a system should be able to detect the sensor data 
faults online and take recovery actions immediately to avoid meaningless monitoring operations. 
In fact, dependability is highly desired in a WSN-based SHM, as an “alert” about a structural 
event conveys a serious concern with public safety and economic losses. 

On the one hand, SHM algorithms in wired sensor networks used by ACSM are generally 
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centralized/global-based IfTTlI . IfTdll . IfTSl . [|T^ . in which they may not need to seriously eonsider 
data eolleetion quality and synehronization errors, ete. This is beeause they may not often handle 
data losses or mismateh, as there are no issues like poor wireless eonneetivity, narrow bandwidth, 
and energy eonstraints. The dependability is affeeted by a reduetion on the quality of data when 
mitigating the eonstraints. Moreover, onee data from the WSN is eolleeted at a eentralized base 
station (BS), it beeomes eomplex to serutinize all the eolleeted data (ineluding faulty signals). 

On the other hand, signifieant efforts have been made for speeifie fault types in WSNs 
uni, m- Some prominent sehemes, namely, decision fusion (or 0/1 decision), threshold-based 
deeision, heartbeat reception have been suggested for fault-tolerant phenomenon (sueh as an 
event) deteetion problems [fTTIl . lfT9l . [I^ . Il2n . [|22ll- These often use simplified data and few 
measurements to adequately deteet eertain faults. However, they are not able to funetion properly 
in an SHM system, sinee SHM algorithms use totally different methods to deteet a damage event. 
For example, the algorithms need raw measured signals rather than the deeision fusion, and the 
analysis of signals (vibration, strain, damping, ete.) that requires a substantial knowledge from 
ACSM domains (e.g., finite element model updating. Eigen matrix, mode shape properties) [|2^ . 
[fT3]l . [[Toll, lEHl- We have evidenee from experimental settings that when there is a ehange in 
struetural health properties (as shown Fig. la), 0/1 deeision sehemes tell sensor 5 is faulty, but 
they eannot tell what happen (faulty signals or damage event) around sensors 4 and 6. Regarding 
all these issues above, a question might be posed: is it possible to have a dependable SHM system 
using WSNs? 

The answer is positive. In this paper, we design a dependable and distributed WSN frame¬ 
work for SHM (ealled DependSHM) that jointly eonsiders ACSM and CS requirements. In 
DependSHM, we propose an algorithm to deteet sensor faults effieiently under the eonstraints 
of the WSN. Dependability in WSNs suffers from various types of faults, ineluding, transeeiver 
failure, link errors, seeurity attaeks (e.g., eollusion), ete [flSl . Il25l . Numerous efforts are being 
published every day in handling these fault types. Instead, we are interested in some types of 
sensor faults that are eommon but diffieult to identify: sensor debonding (when a sensor partially 
or completely debonds from the host structure), faulty signals, faults in offset, bias, precision 
degradation, and the amplification gain factor of signals, noise faults, node missing or failure. 

Most of the sensor data faults fall within these fault models and they direetly interrupt a 
WSN system from deteeting damage. Sensors with some of these faults seem to work properly. 
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Fig. 1. Investigation of the dependability performance of different schemes in structural health monitoring (SHM). 


to communicate to neighbors, to exchange heartbeats, but they return incorrect readings or 
decisions. Under any of the fault occurrences in a practical SHM, we discover a fact that goes to 
SHM system dependability: both faulty and non-faulty sensors can generate abnormal signals or 
decisions (i.e., remarkable changes in the measured signals). The difficult part is that sensor data, 
the only available information, will be affected by both structural damage and sensor faults. We 
further discover an interesting fact that such a possibility can cause an undamaged location to 
be identified as damaged (false positive) or a damaged location can be given undamaged (false 
negative) diagnosis. When we transform these false positive and negative rates into a structural 
health event detection ability as the performance of system dependability (as shown in Fig. lb), 
we find that those decision based and current SHM schemes do not perform well. 

We use a new general measurement, mutual information independence (Mil), between two 
signals u and v from two different sensors for evaluating results in the absence of the ground 
truth. We think that mutual statistical information can be used as an indicator to decide on a 
sensor fault detection in conjunction with damage detection. We attempt to reconstruct faulty 
sensor signals using Kalman filter techniques so that if there is damage, it can be recovered after 
the reconstruction. This does not require any costly actions, including sensor grouping, faulty 
sensor avoiding, masking, isolating, or replacing. 

Our major contributions in this paper are as follows: 

• We study a WSN-based SHM system dependability problem and design DependSHM to 
address the problem. This task is by no means easy, as it requires multi-domain knowledge 
and is associated with optimizing WSN resource constraints. 
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• We propose a non-faulty data eolleetion algorithm, by whieh we utilize an online faulty 
sensor deteetion algorithm based on the funetion of Mil. Although we foeus on sensor 
faulty signals in DependSHM, MIT does not rely on a partieular fault type. 

. In DependSHM, we present a reeovery algorithm to reeonstruet faulty sensor signals based 
on the Kalman filter teehnique. The reeovery is direetly applieable to any kind of spatially 
and temporally eorrelated signals that are eaused by numerous sensor faults in a WSN-based 
SHM system. 

• We evaluate DependSHM via simulations using real data sets, adopted from a SHM system 
deployed on the GNTVT strueture [|2^ . We implement a prototype system developed by 
the TinyOS ll27l running on the Imote2, and verify it on a test strueture. The results show 
that a eareful use of reeovery from faulty signals in DependSHM is effeetive and ean lead 
to a dependable WSN-based SHM system. 

This paper is organized as follows. Seetion 2 reviews related Work. Seetion 3 provides system 
models and problem formulation. Seetion 4 presents the DependSHM framework. The faulty 
sensor deteetion algorithm is in Seetion 5. Faulty sensor signal reeonstruetion is detailed in 
Seetion 6. Performanee evaluation is outlined in Seetion 7. Seetion 8 eoneludes this paper. 

H. Related Work 

Dependability in WSN-based SHM. WSNs have been widely suggested and validated in 
experimentation for SHM system by both the ACSM and CS eommunities in reeent years [fTOll . 
[fm . [|T3]| . [[T4l . [fTSll . [l24l . [|9l , [l28l . Existing sehemes already have suffieient eontributions to 
ACSM and CS requirements [El, m, GDI, lEl, Ca, [El, El, El, El, El, El, El, El, 
El, [l3a . but they suffer from the dependability problem. 

On the one hand, generally data ean be eorrupted at four stages, namely aequisition, proeessing 
and loeal deeisions, wireless transmission, and the final analysis at the BS. Among them, the 
most important stage is the aequisition stage that ean ensure the quality of sensor readings in 
WSNs at the beginning. The quality is also affeeted when applieation-speeifie requirements are 
eonsidered, ineluding high-resolution data, raw data, non-faulty data, dependable and real-time 
deeision-making to analyze aetual struetural health eonditions. These additional requirements are 
traditionally guaranteed by using wired networks. To make WSNs effeetive alternatives to wired 
network system instruments, a first step in this direetion is SHM system dependability in terms 
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of detection of faulty sensor readings and a collection of non-faulty readings at the BS, and then 
dependable monitoring results. In this paper, we take such a step. 

Work from Generic WSN Applications Related to SHM Dependability. Fault tolerance in 
WSNs has been studied extensively by researchers in computer science IfTTl . [|T^ . If20ll . lfT3ll . [|33l|, 
lf20l . ll22Tl . The application background is largely event/target detection in battlefield surveillance, 
environment monitoring, etc. The general objective is that when some sensor nodes give faulty 
readings, how to achieve the correct detection of an event/target over a specific region. Among 
them, a large part of the schemes on fault detection are off-line and centralized-based. Most 
schemes rely on various detection methods, including correlation analysis, 0/1 decision, value 
fusion, decision rules or threshold. Some more details can be found in our earlier work [|20ll . 

@ 1 , m, ( 21 . 

Although dependability support by fault detection and tolerance problem in SHM looks similar 
to the problem of making binary “0/1” detection decisions or value fusions [Q, (2011, (2TI . it is 
fundamentally different from them. To check the validity of this assumption and the dependability, 
we have conducted WSN-based SHM experiments on a physical structure (the settings are 
described in the later part). For these experiments, faulty signals are injected into sensor nodes 
to validate the systems capabilities to autonomously detect and tolerate the fault. As shown in 
Fig. 1, these schemes do not show a satisfactory performance in SHM; specifically, they cannot 
identify what exactly occurs in a WSN-based SHM. We can see that when there is a change 
in structural health properties (as shown Fig. la, a 0/1 decision scheme ll2Tll tells sensor 5 is 
faulty, but it cannot tell what happens (faulty signals or damage event) around sensors 4 and 6. 
There are also changes in signals of sensors 5 and 6. Those existing schemes show here a high 
rate of fault positive and false negative rates, resulting in a low system dependability. When we 
consider WSN-based system dependability as the structural health detection ability, we can see 
that these schemes show low dependability performance, as shown in Fig. lb. The methods of 
detecting faulty sensors by measured signals, removing faulty signals from the measured signals, 
and then identifying what happens exactly in a structure are different from the methods in those 
schemes. 

Numerous techniques towards the area of fault detection and isolation (FDI) have been 
proposed, e.g., model-based techniques, knowledge-based techniques, or a combination of both 
(341 . There are also techniques on fault-tolerant data aggregation that deal with faulty sensor 
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readings caused by security attacks, such as node compromising, collusion [TTSlI . [1251 . They use 
some filtering algorithms (e.g., iterative filtering) at the upper-stream nodes (e.g., cluster head) 
to remove the faulty signals. Though the algorithms seem to be applicable for our case, we could 
not justify them. However, it may be difficult to apply such filtering algorithms at a upper-stream 
node once such a high-resolution big acceleration data from a number of nodes reaches at a 
upper-stream node and the upper-stream node filters all the raw data. Various constraints in 
WSNs and application requirements might be an issue in them. 

Work from SHM Applications Related to SHM Dependability. On the contrary, there 
are also fault detection schemes from ACSM engineering domains [f35l . (SHI, llTTl . [l3^ . The 
concepts in most of them are associated with system failure detection dating back to the 1980s. 
Here, the failure does not imply to faults in a WSN system, but to faults (e.g., damage) in 
a physical systems. FDI concepts (described previously) have also been implemented in a 
number of engineering disciplines, such as ACSM, to improve the availability and reliability of 
SHM systems If^ . However, these are centralized and computationally-intensive, and usually 
developed in wired networks. 

A noteworthy WSN deployment method for SHM applications called SPEM [|T^ . SPEM 
nicely explains the ACSM and CS requirements and is verified on the real structure. It adjusts 
the quality of sensor locations to better fit WSN requirements; meanwhile, the adjustment satisfy 
ACSM location-quality requirements. We have verified SPEM under sensor faults in simulations 
and found that the SHM dependability performance in SPEM drastically decreases from 87% to 
28% as the number of sensors in the WSN increases, as shown in Eig. lb. This is just because 
of a lack of the dependability support. 

To the best of our knowledge, as the first step, we have worked towards the WSN-based SHM 
application-specific dependability, and have got preliminary results flUl, [13^ : and this paper is an 
extension. Our first work is about sensor fault detection algorithm and structural damage event 
in WSN-based SHM lf3^ that works on Natural Erequency extraction and Matching in Clusters 
(NEMC for short), and then tackles faulty sensor readings. However, it has several shortcomings, 
described in @. After improving the shortcomings, we check the dependability performance by 
NEMC, as shown in Eig. lb. Its dependability performance falls between 96% and 76%, which 
is much better than all other schemes. However, such a performance is still not enough to put 
our confidence in a WSN-based SHM system. 


As an extension, this paper ineludes several aspeets. (i) We deal with the problem of SHM 
system dependability and design DependSHM for the problem, (ii) We propose a WSN frame¬ 
work to observe the dependability by removing faulty sensor data from struetural damage data 
and by finding a fault indicator based on Mil. To ensure the dependability, we devise a new 
method for toleranee to a missing or failed sensor. We then address the ehallenge of when both 
structural damage and sensor fault oeeur at the same time, and identify what exaetly happens 
in a strueture. Generally, it is not easy to make sure that there is a faulty sensor but there 
is no damage. Partieularly, we attempt to make DependSHM effieient for reeovery from the 
sensor faults that oeeur for a short duration; thus, masking or isolating the sensor (as required 
in NFMC) is not needed, (iii) For the faulty sensor, we present a proeess flow of the Kalman 
filter and a graphieal representation of it for the signal reeonstruetion. (iv) The motivation for the 
dependability, elarifieations on system models, and SHM-speeifie terms are given, (v) A detailed 
performanee evaluation and new results are presented. 

III. Models and Problem Formulation 

In this seetion, we provide some neeessary models and definitions. Then, we formally formu¬ 
late our problem. 

A. Network Model 

We assume that a set P of m wireless sensors is in eharge of performing different types of 
applieation tasks (e.g., sensing the vibration, strain, and damping signals, pressure, temperature, 
ete., in the eontext of SHM) and sending its measurements to neighboring nodes. A referenee 2D 
building model is shown in Fig. 2a, where sensors (white eirele) are deployed on it and a remote 
monitoring eenter or BS station loeation (eolored eirele) is at a remote plaee. Fig. 2b shows the 
traditional WSN framework for SHM (whieh is similar to the framework in dUl), while Fig. 2e 
shows the proposed distributed, dependable WSN-based SHM framework, DependSHM. 

Consider that the set of sensors is deployed on a strueture by finding loeations from a set 
of eandidate loeations of the strueture; L = {lo^h^h-,-" i^m}, where sensor i is plaeed at 
loeation Zj, and Iq is a suitable loeation of the BS. For high-quality monitoring results, we follow 
engineering-driven sensor deployment method [IT3ll . [HHl, [[^ . [l40ll . Regarding DependSHM in 
Fig. 2e, sensor i ean be allowed to aequire data, analyze it loeally (prepare natural frequeney 
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Fig. 2. WSN-based SFIM frameworks. 

if needed), identify faulty readings, and finally eompute mode shapes loeally or transmit the 
non-faulty raw data to the BS (see Appendix B for the natural frequeney and mode shape 
definitions). 

Let Rmax and Rmin be the maximum and minimum eommunieation ranges of a sensor, 
respeetively. Rmin is used to maintain loeal topology, where a number of sensors is allowed to 
share their signals with their neighbors for damage deteetion, also used for fault deteetion. The 
intention of adopting adjustable eommunieation range is to reduee energy eost for transmission. 
Note that Imote2 sensor platform supports diserete power levels iHTI . Two loeal topologies ean 
be seen in Fig. 3; eaeh sensor ean be overlapped by one or more sensors. When a sensor 
eommunieates to the BS direetly, Rmax is used. Eaeh sensor eorresponds to a vertex in a 
network graph denoted by G, and two vertiees are eonneeted in G if their eorresponding sensors 
eommunieate direetly. The graph G is ealled the eommunieation graph of this WSN. 

B. Sensor Faults 

1) Fault Model: We foeus on the following set of sensor faults that oeeur in a real WSN-based 
SHM system: 

• Sensor debonding—it is a very eommon fault in a WSN-based SHM that oeeurs when a 
wireless sensor slightly or eompletely debonds/detaehes from the host strueture. This affeet 






















10 


is seen in terms of accurate vibration capturing from the structural response. 

• Faulty signals—these are caused by precision degradation, breakage, etc., especially in 
vibration signal capturing. For example, a sensor reports a constant value for a large number 
of successive samples, where the constant value is either very high or very low compared 
to the “normal” or “reference” value. 

• There are faults in offset, bias, and the amplification gain factor of signals. For example, 
the offset fault is due to calibration errors in sensor signals, which differs from the normal 
value by a constant amount, but the sensor readings still exhibit normal patterns. 

• Noise faults are caused by longer duration noisy readings that affect a number of successive 
samples. 

• There is also node missing or failure. 

Sensors with these faults seem to work properly (except for the last type), to communicate to 
neighbors and exchange messages, but they return incorrect values or decisions. We tackle these 
faults in DependSHM. 

2) Fault Detection Model: We assume that sensor i exchanges its signals with its neighbors 
in each sampling instant t in T^, where Td is the monitoring round, i.e., the time is divided into 
discrete sampling periods. In each period, i broadcasts its current readings to one-hop neighboring 
nodes using Rmin- Besides the readings, i may be enabled to make a decision locally or recover 
the mode shape and forwarded it to the BS. The signal y\ measured by a faulty sensor at t is 
subject to noise effect a. Then, let y be the measured output reading that would be transmitted 
to the neighboring nodes, which is given as follows: 

y = y\ + (^ ( 1 ) 

The measurement noise, a, for non-faulty sensors may be a small random noise in practice. 
However, it also can greatly affect damage event detection. Consider that a subset N of sensors 
is non-faulty at time t. In SHM, when all of the sensors are non-faulty, it is easily possible 
to estimate the mode shape from the signals. However, if a sensor is faulty, it is possible to 
produce predicted signals for the mode shape by using neighbors’ signals, correlation statistics, 
and the extent of a. Suppose that damage may occur at time t, anywhere in the structure. A 
subset D c P of sensors around the damage area is possibly able to detect the damage. Some 
of the sensors from D may provide faulty signals. Thus, to detect faulty sensors, the sensors in 
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D split into two further subsets: 

N= sensors assumed to be non-faulty 
F = sensors assumed to be faulty 

Note that these two sets are disjoint so that 


n F = {} and N U F = D, where D G P 

Generally, sensors anywhere in the WSN ean be faulty/failed. However, we put an emphasis 
on eontinuous monitoring and on those sensors whose signal have ehanged signifieantly (due to 
a fault/damage). 

Definition 1 [MIL Mutual Information Independenee]. A function denoted by a;() is defined 
by the quantify of how much the measurement correlation between sensor nodes in N and sensor 
nodes in F deviate from the correlation model. 

We state the Mil funetion as an indireet vibration signal measurement. Assume that a prior 
eorrelation model C of xf presents ll4^ . C ean be given as a referenee set by all immediately- 
stored data in the sensor loeal memory after WSN system initialization. The Mil funetion between 
two signals of sensors i and j at time / in F, is given as follows. 


^{VhVj^C) (2) 

Consider that the sensors in N and F eapture vibration signals and broadeast their measured 
signal sets yj^ and yp, respeetively. Thus, Mil between the two sets of signals of N and F is 
given as follows: 

uj{yN,yF,C) (3) 

Given R eonseeutive signals, the Mil funetion estimates the eorrelation between ypf and at 
time t as: 

R R 

A{N, P) = u}{yN, yN, C) - ^u:{yF, yN, C) (4) 

t=i t=i 

A{N,F) is aehieved by redueing the deviation between non-faulty sensors in N, and by max¬ 
imizing the deviation between non-faulty sensors in N and faulty sensors in F. D ean be 
eontrolled by the system user eonsidering Rmin, neighborhood size, or network density. 
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(a) 2 overlapping sensors 
and 3 neighbors 


(b) 2 overlapping sensors 
and 4 neighbors 


Fig. 3. Topologies with different numbers of overlapping sensors in different neighborhoods. 

C. Energy Cost Model (cost(ei)) 

One important objective is to minimize the energy cost of the network. Let cost(ej) denote the 
total energy cost of sensor i, including measurement, computation, transmission, and overhead. 
Consider a shortest path routing model [IT3l . ifT^ : there is a path from sensor i to neighboring 
sensor node or the BS j: q = Zq, Zi... Zk- Sensor i propagates the data to them. We can find the 
ith hop sensor on each path and calculate the amount of traffic that passes along on the paths 
within each round of monitoring data collection (T^, d=l,2,..., n). Then, the cost{ei) can be 
decomposed into the following four parts: 


cost(ej) 


(5) 



Here, i) is the maximum energy cost for data transmission over a link between a transmitter 
and a receiver, ii) Ccomp is the energy cost for processing data locally, e.g., computing equation 
(6). iii) Csamp is the energy required for a sampling cost of M data points lf33l . Il43l . iv) Coh is 
the additional overhead for some causes, e.g., fault detection, signal reconstruction, copying data 
to a local buffer, and network latency. See Appendix A for an extended version of the energy 
cost model. 

D. Problem Statement 

Given: a set P of m sensors and a BS, which are deployed over a physical structure and 
are involved in monitoring structural health that is reported to the BS. 

Find: a subset D (C P) of sensors that involves in detecting structural damage event such 
that the sensors in D) are non-faulty and the sensors m F = D — N are faulty, subjected 
to the following constraints: 

• Data delivery: \/q = zq ... Zk used for data delivery, where q\j — l]q[j] < Rmax, j = f ■ ■ - k; 

• Connectivity: Vz = 1... m, 3q = zq ... Zk, g[0] = U, q[k] = lo\ 
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• Structural modeling. 

Objectives: minimize A{N, F) and minimize cost{ei}, and maximize dependable mode shape 
eomputation. 


IV. dependshm Framework 

In this seetion, we present DependSHM, the dependable WSN-based SHM framework. It 
is divided into three stages: distributed framework for struetural health event detention (e.g., 
damage, eraek, eorrosion), faulty sensor detention, and faulty reading reeonstruetion. 

A. Basics of Structural Event Detection Algorithm 

The eentral foeus of SHM is the detention and loealization of events (eonsidering damage) 
within various types of struetures. Generally speaking, SHM teehniques rely on measuring 
struetural responses to ambient vibrations or foreed exeitation. Ambient vibrations ean be eaused 
by earthquakes, wind, passing vehieles, or foreed vibrations, or ean be delivered by hydraulie 
or piezoeleetrie shakers. It ean also eaused by a damage oeeurrenee. 

A variety of sensors, e.g., aeeelerometers, strain gauges, or displaeement ean be used to 
measure struetural responses. SHM teehniques infer the existenee and loeation of damage by 
deteeting differenees in loeal or global struetural responses before/after a damage oeeurs. The 
responses are usually eomprised of frequeneies in the tens of Hz, and ean be sensed using rela¬ 
tively inexpensive low-noise MEMS-based aeeelerometers. Inereasingly, the ACSM eommunities 
are beeoming interested in aetive sensing teehniques [fTOll . whieh measure struetural responses 
to foreed exeitations. In order to identify the damage, two neeessary struetural eharaeteristies 
are important: mode shape ($) and natural frequeney (/). 

B. Mode Shape Computation at Each Sensor 

Eaeh type of strueture (aireraft, building, bridge, ete.) has a tendeney to vibrate with mueh 
larger amplitude at some frequeneies than others. / and $ rely on struetural material properties, 
geometry, and assembly of its eonstituent members. We use state space model, whieh is widely 
aeeepted by ACSM eommunities for eapturing struetural dynamies to eompute <I) ll2^ . lUOll . 
We mention the proeess we use for $ eomputation, whieh is the same proeess is used for 
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designing faulty signal reconstruction (after fault detection). The state space matrices for a finite¬ 
dimensional linear structural dynamic system can be succinctly obtained by the linear differential 
equation: 


Mx + Kx + a = F{x,t) (6) 

Here, function F{x,t) is the response of the structure over a period of interest at certain 
sensor locations, where x is the structural response at time instant t. M and K are the matrices 
of mass and stiffness coefficients of the various elements of the structure, respectively^, a is the 
signal to noise. In (6), damping is neglected for an advantage in detection (see Appendix C for 
details). 

In traditional SHM algorithms, the state space model is computed in a centralized/global 
fashion. We argue that it could be quite costly for the resource-limited WSN. Considering SHM 
as a big data application, to make use of the WSN for SHM, we mitigate this problem by 
allowing each sensor to work only with local structural responses rather than the global. For this 
purpose. We modify the model, considering the implementation of the model for each sensor 
location. To reduce the system order, a transformation of the state space into mode coordinates 
is necessary. This transformation is derived by determining a diagonal matrix, which contains 
a certain number of Eigen frequencies covering the natural frequency ranges of interest. By 
applying the mode transformation, based on the mass normalization {0^}, the refined $ is given 
by: 

X = (7) 

where h is the mode participation factor. We have, 

hi + 6ihi = Fli ( 8 ) 

(7) implies the refined response of the structure that is a sum of the responses in each mode. 
We can express it more explicitly as follows: 

n 

x = ^ hi(t)i (9) 

i=l 

where 5* is the fth eigenvalue, and Fli = (pjf is the fth mode of responses under force or 
ambient excitation input. The summation is given over all of the n modes of the structure that 
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Fig. 4. Providing the dependability in the WSN Framework: the step by step process of sensor fault tolerance and damage 
detection. 


are measured by the individual sensor. Typically, only the lower modes are important because 
the force excitation is concentrated in these modes. Each sensor extracts its local $ that can be 
used for both faulty signals and damage event detection. The method of extraction, including 
the difference between / and $ can be found in Appendix B. 

Is it Possible to Compute $ under Sensor Faults: In practice, $ is greatly affected by a faulty 
sensor signals (see Appendix B for more detail), especially when a sensor is placed at a optimal 
location [fT3]l . ifT^ . If a signal is detected as faulty, the measured signal is reconstructed directly 
for the actual mode shape, by collecting the signals from the sensor location or reference signals. 
We use neighboring sensor nodes’ signals for detecting a faulty sensor and reconstruct its signal. 
In this work, not all the sensors’ signals using (6) will be measured. The measured output of a 
sensor i at time t, yj, can be obtained by: 

y\ = Qx ( 10 ) 

where Q is the measurement matrix. 

C. General Overview of DependSHM 

Fig. 4 summarizes the whole WSN framework for SHM. Once the WSN starts operating, 
in each monitoring round (T^), a number of signals is measured at each sensor. Based on the 
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measured signals, each sensor identifies faulty sensors by using MIL If any faulty sensor is 
detected, the neighboring sensors reconstruct the signals, while a faulty sensor itself can do the 
same task if it still works. It is highly possible that a sensor exhibits faulty behavior temporarily 
in many cases. However, if a sensor fails or it is missing from its location, we still guarantee 
the signal reconstruction for the sensor. We do not assume to isolate or deploy a new sensor as 
it is a costly task doing so. 

In addition, for the high-quality SHM, we must provide monitoring information of each sensor 
location, since ACSM communities mostly deploy sensors at optimal locations lfT3ll . ifT^ . Each 
sensor locally computes the final <I) and transmits to the BS directly, which is a relatively small 
amount of data. The BS assembles all the received final <I)s and identifies the structural damage. 
There is a high possibility that the BS does not receive any of $ caused by data packet-loss 
or others. If sensor forwarding fails, the BS still has the final results received through the 
neighboring sensor nodes. In addition, we allow each sensor to keep the final results in the local 
memory until a sensor receives an acknowledgment from the BS. Each set of raw <I) is of a 
number of KBs while each final result or <I> computed by a sensor is a number of bytes. In this 
framework, each final <I) received from the neighbors is not processed. 

V. Eaulty Sensor Detection 

This section describes non-faulty sensor data collection and faulty sensor detection algorithms, 
according to the model described earlier. 

A. Data Collection and Faulty Sensor Indication 

We assume that sensors are likely to generate abnormal signals. The signals are measured by 
the vibration, which may be incorrect compared to the neighbors, previous signals, or reference 
signals. We first show data collection at every sensor in the WSN. A subset of sensors, say D 
of sensors that are in a sensor’s Rmin, share their data with each other, and participate in faulty 
sensor detection. 

Algorithm 1 simply presents the data collection method based on the neighborhood. While 
theoretically this procedure involves multi-hop communication, consider the fact that for SHM 
application, the radio communication range of current sensor nodes exceeds the area in which 
sensors gather signals. We limit sensors to communicate within the one-hop neighboring nodes. 
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Algorithm 1: Non-faulty Data Collection for Damage Detection 

Input: a neighborhood with a bounded degree, 
t <— transmission at a time slot 
Stepl: for j = 1 to m do: 

node i acquires data and buffers it 
Step2: i transmits its data to its neighboring node j 
for 7 = 1 to n do: 
for each node i do: 

If i has data not forwarded to j then 
transmits a new data to its j in t 
Step3: call Algorithm 2 

Step4: data aggregation (extracting frequency sets, compute <1?^) 
compute final 

StepS: transmit the final <1? toward the BS 
return health status of every sensor location 


We think that multi-hop communication is not mature enough. The nodes that are one-hop 
away from the BS will directly send the data; otherwise, the data is sent through one or 
more intermediate nodes. In Stepl of Algorithm 1, every sensor acquires signals captured from 
vibration responses of the structure, and buffer them temporarily. Then, it transmits and receives 
the measured signals. The sensors check if there are any faulty sensors, i.e., a sensor with faulty 
signals via Step2. 

StepS executes Algorithm 2. When a remarkable change appears in a sensor’s signals, there is a 
possibility that a sensor is faulty. The Mil is used to detect faults. Let us consider the statistical 
dependency between the two sensors’ signals quantified by MIL oj measures the information 
about one sensor that is shared by another sensor in the set of signals in D. It is seen that u 
changes as soon as a sensor fault occurs, because the faulty signal is not present in the reference 
or other sensor signals. 

We use a joint Gaussian distribution based correlation model. Multivariate Gaussian distri¬ 
bution has been used to accurately model the correlation of many types of signals in literature 
lf22l . Each signal is broadcast to sensors in D, where Ah sensor signal yj G and jth sensor 
signal yj G y\), i,jeD and D C P. For simplicity, yj as u and yj as v are denoted hereafter. 

Hence, it would be worth considering how to find joint probability density between two signals. 
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The statistical dependency/independency between the two Gaussian distributed time signals u 
and V can be expressed in the form of the joint probability density p{u, v) of signals: 


p{u,v) = 


‘i'KTu'Tv \/l — Pu 
2 


=e — 


2{1-pL) 


/ u — flu \ _ 2^p I / v — fj-v ^ 

\ '^u J '^u'^v \ '^v J 


( 11 ) 


where pu, and are the means and the standard deviations of the signals u and v, 

respectively, puv is the correlation coefficient between the two signals. It is given by: 

E {{u - Px){v - Py)} 


Puv — 




( 12 ) 


The correlation coefficient can also sometimes be used to determine if two signals are statistically 
independent. On one hand, if \puv\ = 1, there is a strong correlation between the two signals. On 
the other hand, if \puv\ = 0, the two signals are not correlated. The correlation can be interpreted 
as a weak form of statistical dependency. In EOl . it is shown that two random variables, which 
are not correlated, can even so be statistically dependent. This is why we take the statistical 
dependency or independency. The product of the marginal densities pu and p^ of the signals u 
and V, respectively, is given by: 

p{u,v) = p{u)p{v) (13) 


If the expression in (II) is equal to the product of the marginal densities in (13), the signals 
are completely independent. One possibility to quantify the statistical dependency between two 
signals is to calculate the Mil of them, as follows: 


u{u, V, C) = 





(14) 


p{u)p{v) 

The base of the logarithm determines the units in which information is measured. (14) shows 
that if u and V are independent, oj becomes zero. A forward approach is to divide the range of u 
and V into finite bins and to count the number of sampled pairs of ho = (mq, Vo).,o = 1, 2, • • • , n, 
falling into these finite bins. This count allows for approximately determining the probabilities, 
replacing (15) by the finite sum: 


0 Jun{u,v,C) = y'p„^(a,6) log 


Pu,v{a,h) 

ip) 


(15) 


where p«(a) n„(a)/n and p„(6) ~ nu{b)/n are the probabilities based on the number of 

points n„(a) and n„(6) falling into the ath bin of u and the 6th bin of v, respectively. The joint 
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probability is b) ^ n{a, h)/n based on the number n(a, h) of points falling into box nos. 

a, b. Mil is non-negative and symmetric: 

u{u,v,C) = u{v,u,C) > 0 (16) 

The Mil for all possible combinations of sensor outputs yr and yg (except r = s, i = 1, 2, • • • , r, j = 
1, 2, • • • , s) is computed, which leads to an tu-matrix for all combinations of r and s. The basic 
idea is that the Mil changes when a sensor fault fr is present. Suppose that it is in the rth 
channel or index: 

yr = yr + fr (IV) 


This fault appears only in the rth channel. Thus, we should expect that all combinations with 
index r should show a reduction of u. This allows us to localize the faulty sensor. One or more 
faulty sensors can be simultaneously detected in the same way. One possibility to visualize the 
faulty sensor is to use the relative change as a sensor fault indicator 

\^yr ~ ^ref\ 


K = 


UJ, 


( 18 ) 


yr 


where yr is an actual data set and the lower index ref is one reference data set. The method 
based on Mil is able to detect sensor faults in different combinations of them. 


B. Algorithm 2: Faulty Sensor Detection 

Under centralized detection, the BS handles the damage and faulty sensor detection process. 
In each decision cycle, the BS makes a decision about the faulty sensors, solely based on the 
k most recent signals received from each sensor. The BS computes the Mil for each signal, 
and chooses the signal with the maximal independence for fault detection. This detection is not 
suitable for resource-constrained WSNs. For example, if each sensor needs to send all its signals 
to the BS (where each sequence of signals or raw natural frequencies can be from XOkb to 
XOOOkb, X = 1,2,...), the centralized WSN may not be able to operate for a given period of 
time. In a large-scale WSN deployment, the situation becomes serious. After capturing data at 
high frequency in SHM, sensors should reduce data before transmission. 

In contrast, the faulty sensor detection (see Algorithm 2) can execute in a distributed manner 
where each sensor makes a decision on the collected signals locally, as described earlier. In 
the algorithm, if the local decision on a sensor’s signals, Xy^ > 0.5, the sensor is faulty. This 
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Algorithm 2: Distributed Faulty Sensor Detection 

Decision: < 0.5: non-faulty), (A“^ > 0.5: faulty) 

for each sensor i £ N where N D do: 

//set initial decision as a uniform distribution 
{Xnr)i — 0 // each sensor i is non-faulty 

loop 

for each sensor i do: 

{X‘:;ljj(neighbor) < — receive neighbor yth decision 
samples <— N samples from {Xnr)3 (neighbor) 
for each sample h do: // h is an index 
Rh <— non-faulty sensors 
Fh <— faulty sensors 
fh ^ {Rh, Fh) //Eq. 3 
for each sensor i do: 
if > 0.5 then 

i marks/reports itself as a faulty sensor node 
if i does not transmit the decision then 

j marks/reports about i as a faulty sensor node 
call Algorithm 3 // ith sensor signal reconstruction 

end 


means that Mil is high on the sensor’s measured signals. The distributed method only requires 
neighbors to be synehronized. In addition, the deteetion is almost immediate and online, since 
a sensor does not need to wait for the signals from sensor nodes at more than one hop away. 
Moreover, the detected faulty signal set is not forwarded toward the BS; thus, the communication 
cost is relatively low. The energy cost becomes lower. 

Mil does not rely on particular fault types. The algorithm 2 based on Mil is able to detect 
different kinds of faults (as modeled before). However, it may fail to detect a node missing or 
failing. We provide Appendix E for handling the node failure or node missing. 

VI. Faulty Signal Reconstruction 

In this section, we propose a Kalman Filter technique (KF) for faulty wireless sensor’s signal 
reconstruction. The KF has received extensive attention to describe the recursive solutions of 
predicting state variables for linear systems MM . We consider it, as it can generate the best 
estimation if the optimal filter is linear among all the linear observers, because it minimizes the 
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Fig. 5. Sensor signal process flow under the KF. 

error covariance. In ACMS engineering domains, the KF has been studied for on-line damage 
detection ||29l . Here, we utilize it for faulty sensor signal detection. 

A. Kalman Filter in State Space Representation 

The description of KF is made with the help of the state-space representation of the structural 
system, as described in Section 4.2. One radical concept of the KF is that the state estimation 
is recursively corrected by the actual physical system outputs. Then, using (6), the equation of 
motion for time discrete and time invariant cases are given as follows: 

h = Mt-iZt-i -f Kt-iUt -f (Tt-i 
mt = MtZt -f KtUt + at 

Ut is the excitation at a specific frequency at time t. Mt and Kt are transition matrices. The signals 
at-i and at represent the measurement noises, respectively (refer to Fig. D in Appendix D for the 
state-space equation-based KF). When measuring the responses of a dynamical structural system 
by sensors, the actual signals produced by the sensors are contaminated by noise due to internal 
manufacturing defects, physical interference, or external environmental effects. According to 
features of the KF, we assume that every measurement from the wireless sensors contains noise; 
thus, if the noise measurement is zero, the KF collapses. Setting the mean of noise as zero is 
a common practice: E[at\ = 0. Noises are assumed to be independent of each other, and are 
normally distributed with covariance matrices, = [aa'^]. 

The underlying KF information is that KF is a recursive algorithm consisting of a loop (see 
Fig. D ) which is passed through for each time instant t. The estimation of the system state for 
t is determined from the weighted average of the actual measured value at time instant t and 
the prediction of the system states for this time instant. The weight factors of this average are 
determined from estimated uncertainties in each loop, which are also connected to the predicted 
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system state and to the new measured value. The lower the uneertainty, the higher is the weight 
faetor; i.e., Kalman gain (Kt). The uneertainty is ealeulated with the help of eovarianee matriees 

m- 

We shortly deseribe the faulty signal reeonstruetion proeess. At first a priori state estimate Pk 
for the state veetor of the system are estimated [l45l : 

= Mt-iZt-i + Kt-iUt 

t ^ 20 ) 

Pk = + Cy 

After getting the measured value , a posteriori state ean be estimated in the eorreetion step, see 
(21). For the posteriori estimation, the differenee between the measured and estimated signals 
are weighted by the Kalman gain faetor Kk- 

ir‘ = r“ + Urn, - M,lf° - K,u,] = !r“ + K,lm, - mfl (21) 

Kt = PtMj[MtPtMj + c.]-* (22) 

The priori estimated error eovarianee Pk in the predietion step is used to update the Kalman 
gain faetor in (21) whereas Pk itself is updated by the a posteriori estimated error eovarianee. 
The KF is used here based on the Matlab implementation, whieh delivers the optimum Kalman 
gain (Kk) together with the steady state error eovarianee matrix (Pk). 

For the system state estimation using the KF, the proeess and the measurement noise eovarianee 
are determined. Sinee the system input is assumed to be unknown and the model uneertainty is 
high, the proeess noise eovariates are set to high values. The values for the measurement noise 
eovarianee matrix will be determined a priori by the first estimation of the measurement error 
rrit — as given in (21). The overall proeess flow of the KF is illustrated in Fig. 5. 

B. Sensor Signal Reconstruction Algorithm 

If there is a sensor deteeted as faulty using Algorithm 2, the sensor signal reeonstruetion 
algorithm (see Algorithm 3) is used by the neighboring sensor nodes. The basie idea of the 
signal reeonstruetion is as follows. When a sensor signal does not eorrespond to the modeled 
system (monitoring its loeation) and it was erroneously assumed that this signal has a low 
measurements noise eovarianee, then the signals from the other sensors eannot be eorreetly 
reeonstrueted, and the differenee between the measured and estimated signals will be high. If 
the value of the eovarianee for the faulty sensor signals is set as high, then the KF will reeonstruet 
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Algorithm 3: Sensor Signal Reconstruction 

Step 1: Detect the faulty sensor by changes 
of indicator (Algorithm 2) 

Step 2: Recreate the state space model of the structure 
w.r.t. the sensor location (including faulty or 
nonexistent sensor) on the structure 
[see M and K matrices in Eq. (6)] 

Step 3: Determine the process and measurements noise 
covariance matrices //(see Eq. (20)) 

Step 4: Set the covariance value(s) of the assumed 
faulty sensor(s) to be high 

Step 4: Solve the state space equation of motion through 
KF with the estimated state vector Zf (in signal 
correction step). The KF is driven by the input 
ut = 0 and the measured signals yl 


all the signals, including the incorrect signals, with the help of the other signals and the model. 

In this manner, it is possible to reconstruct more than one signal simultaneously. The number 
of signals that can be reconstructed rely on the number of neighboring nodes in case of the 
distributed system, all of the nodes in the network in case of the centralized system, and on the 
quality of the model. For a better understanding of Algorithm 3, the procedure is broken into 
several steps. 

By means of Algorithm 3, when just one sensor node does not work properly, it is possible to 
identify and reconstruct it only with the help of KF. For this purpose. Steps 2 to 5 are applied. 
Here, Steps 3 to 5 have to be calculated several times. The number of loops over these steps 
corresponds to the number of neighboring nodes in the case of the distributed WSN or all of 
the nodes in the case of the centralized WSN. In each loop, the measurement variance of one 
sensor is set to be high. In addition, we attempt to detect a missing sensor and construct its 
signals that can be found in Appendix E. 

VII. Performance Evaluation 

A. Simulation Studies 

1) Methodology: We conduct comprehensive simulations using MATEAB to evaluate DependSHM 
that includes the faulty sensor detection methods and signal reconstruction algorithm. We use 
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Fig. 6. Performance of different fault detection methods; achieved Mil under sensor faults. 

real data sets eollected by the SHM system employed on the high-rise GNTVT ll2^ and a SHM 
toolsuite Il46ll . We use the data sets for the 100-sensor ease in our simulations. We perform the 
WSN deployment via our WSN-based deployment seheme suggested in [|T3l . which is supported 
by the ACSM engineering deployment methods ifT^ . The simulation environment is a 450 x 50 
sensing field regarding structural environment, e.g., bridge, building, aircraft. 

The background data is simulated as vibration influenced by the 100 sensor locations in the 
field. A random Gaussian noise is added to all the data. The mean of the noises is zero, and 
the standard deviation is 10% of the real signals. From the data sets, a set of data is used as 
reference data to train the joint distribution, and another set of similar data is used for testing. A 
random Gaussian noise is added to all the data. The mean of the noises is zero, and the standard 
deviation is 10% of the real signals. From the data sets, a set of data is used as reference data to 
train the joint distribution, and another set of similar data is used for testing. The noise is present 
in both the data sets. Thus, the trained correlation model reflects the noises. In the distributed 
detection method, each sensor makes a decision based on signals received from neighboring 
nodes within Rmin- After a sensor receives a decision, it recomputes its Mil and chooses to 
change its decision accordingly. The energy cost and routing models described in Section 3 are 
used for evaluation. 

For comparison, we implement other three schemes, including SPEM ifT^ and NFMC [[3^ . 
We compare their performance with DependSHM. We consider two schemes for observing 
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Fig. 7. Performance of different fault detection methods; the fault detection accuracy. 


the performance of our fault detection and tolerance methods: i) distributed fault detection 
under localized data processing (DependSHM); ii) centralized fault detection under localized 
data processing (cSHM). SPEM is a WSN deployment method for SHM that nicely explains 
CS requirements and is verified on the GNTVT. It adjusts the quality of sensor locations to 
better fit WSN requirements. It is a centralized data processing method. We intend to verify its 
performance under fault detection and tolerance support and compare with DependSHM. NFMC 
is our preliminary fault detection and tolerance WSN-based SHM scheme, which is based on 
the natural frequency extraction and matching. 

Using simulation results, we compare DependSHM with them in several aspects under the 
fault injections;: i) fault detection accuracy; ii) dependability in terms of detection ability and 
mode shape (<I>) recovery, etc., and iii) energy cost of the WSN. Here, the detection ability is the 
rate that is calculated by the percentage of successful faulty sensor detection to the percentage of 
the amount of the sensor fault injection. This includes both the false positive and false negative 
occurrences. Here, false positive cases are recorded as an undamaged location of the structure 
is identified as a damaged location, and false negative cases are recorded as a damaged location 
of the structure is identified as an undamaged location. 

2) Results: In the first set of simulations, we implement all three schemes under the sensor 
fault injection (through modifying a number of sensors’ signals randomly in the data sets). 
A fraction of the sensor nodes is randomly selected and the modified faulty signals are fed 
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Fig. 8. Observation on the actual mode shape curvature vs. mode shape curvature in cases of faulty sensors and recovery (signal 
reconstruction). 


into their acquisition modules. We vary the number of faulty sensors from 15% to 25%. Each 
sensor node broadcasts its readings towards the neighboring sensor nodes. Each of the faulty 
readings is replaced by a random number independently drawn from a uniform distribution in 
the deployment field (0, 450). Eig. 6 shows Mil achieved by the four schemes. Out of them, 
DependSHM achieves the smallest value, followed by cSHM method. SPEM method performs 
poorly, since it requires centralized data processing and shows a significant amount of data 
packet loss during transmission. Due to heavy data losses, its performance on the Mil is low. 
Nevertheless, NEMC still outperforms SPEM in many sensor fault detection cases. 

Eig. 7 depicts the fault detection accuracy, which is computed as accuracy = (true positive 
+ true negative)/all. The detection accuracy in DependSHM is about 98%, which outperforms 
others. In SPEM, the detection accuracy is poorer (less than 80%) than that of others, while it 
is from 75% to 85% in NEMC. One major cause is that peak natural frequency signals used in 
NEMC and DependSHM achieve higher MIL However, we experience that the fault detection 
accuracy rate becomes lower in NEMC and SPEM than in DependSHM and cSHM, as the 
number of faulty sensor nodes in the WSN increases. 

Dependability Verification. We next observe the structural health condition using WSNs in 
simulations. We estimate mode shape (4)) curvature under sensor faults and the signal reconstruc¬ 
tion of the faulty sensors. We recover the first 4) (see Eig. 8) of the simulated structure with 100 
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Fig. 9. Energy cost in different measures analyzed by five rounds of monitoring. 


locations, which cover up to 450 meters of the strueture. $ is extraeted, based on sensor eolleeted 
signals in DependSHM. We ean see the impaet on the health status, in whieh the aetual mode 
shape is distorted under the sensor faults, whieh is sueeessfully reeovered by the eorresponding 
sensor signals’ reconstruetion. This implies that, if there is no appropriate faulty signal detection 
and tolerance methods, having sueeessful monitoring operations will be diffieult to aehieve. Thus, 
a WSN-based SHM system without having sueh methods will not be dependable. More results 
and analysis of the performance of WSN-based SHM system dependability in different schemes 
ean be found in Appendix F. 

Energy Cost. We next observe the energy eost in the first five rounds of monitoring seen in Fig. 
9 for DependSHM, eSHM, and SPEM schemes. We consider two cases: the amount of energy 
eost in the ease of normal monitoring operation when there is no fault injeetion in the WSN and 
the WSN needs to provide health monitoring; the amount of energy eost in oases of monitoring 
operations when there are the fault injection and recovery from the sensor faults through the 
signal reoonstruotion. We oaloulate the energy eost for oomputation, transmission, and overhead 
under both looalized and oentralized data prooessing. We did not oonsider the energy eost for 
measurement, as we oonsider the same amount of energy eost for the measurement in all the 
sohemes. We ean see that the amount of energy eost for oommunioation in SPEM and oSHM is 
very large oompared to SPEM. The amount of energy eost in oSHM is seen to be around 60% 
more than that of DependSHM, while it is 90% more than that of DependSHM. 








































28 



Fig. 10. Global mode shape (<1?) curvatures based on each sensor individual frequencies in SPEM and DependSHM. 

B. WSN Prototype System Implementation 

1) Methodology and Wireless Sensor Platform: We validate our seheme by implementing a 
proof-of-eoneept system using the TinyOS on Imote2 platforms WT\ . Our main objeetive is to 
verify i) the dependability and ii) the energy-effieieney of the system. We target the aeeuraey or 
sueeessful $ identifieation, beeause it ean provide us with the answer, whether or not a WSN- 
based SHM system is dependable in terms of various sensor faults. We employ 10 integrated 
Imote2s ealled SHM motes on a test strueture (refer to Fig. G1 and Appendix G1 for more 
detail); an additional Imote2 is loeated 15 meters away as the BS mote, and a PC as a eommand 
eenter for the BS mote and data visualization. The test strueture has 10 floors; at eaeh floor, a 
mote is deployed to monitor the strueture’s horizontal aeeelerations. In the experiment, Rmin is 
adjusted by the diameter of the strueture, whieh is adjusted by estimating the height of the test 
strueture and eaeh floor. 

Fault Injection. To produee a sizable vibration response of the test strueture, we eolleeted 
the original data by vertieally exeiting the test strueture using a magnetie shaker. We injeet 
the sensor faults in two oases: i) debonding fault between the 5th sensor and the strueture; ii) 
preoision degradation fault during aooeleration signal oapturing by the 10th sensor. The sensors 
are expeeted to work properly but exhibit faulty aeeeleration measurements or deeisions. 

2) Experiment Results: In the first set of experiments, we oompute mode shapes (<!)) in the 
base-line struotural system, when there are no damage events and no sensor faults. These are 
eomputed by sensor initial identified natural frequenoy (presented in detail in Appendix G2). Fig. 
10 demonstrates two mode shapes of the strueture, oaptured by using the identified frequenoies 
in both SPEM and DependSHM sehemes. The results oompare the exaet mode shapes obtained 









29 


by centralized WSN where the motes transmit their measured signals to the BS. On the other 
hand, in DependSHM, the final mode shapes provided by the each mote are combined at the 
BS. The errors between two processes are analyzed. The accuracy of mode shapes identification 
in SPEM is at least 13% lower than the accuracy in DependSHM. It is found that DependSHM 
has around 16% better accuracy than that of SPEM under topology 2 (as shown in Eig. 3). The 
global mode shape computed at the BS assembles all of the sensors’ final results. Note that in the 
mode shape assembling, mode shapes from different motes correspond to the slight difference 
in the set of natural frequencies. 

Eig. 11a shows experimental fault detection results. Remarkable changes in signals of the 5th 
and 10th sensors and some of their neighboring nodes are detected. The Mil changes in both 
of the sensor fault cases can be seen in Eig. 11b. Some of the neighboring nodes, e.g., 4th, 6th, 
9th, and so on have also provided an extent of change in their MIL This is because their signals 
have also been partially affected by the fault injection. The corrupted/faulty signals of a sensor 
(e.g., the 5th) are reconstructed (details performance analysis on the signal reconstruction can 
be found in Appendix G3). 

The energy cost analysis of the experimental WSN is provided in Appendices G4. We find 
that, in the case of faulty sensor detection and signal reconstruction, DependSHM consumes a 
small amount of energy in computation with a slight overhead, which is 5% to 8% of the total 
energy cost in each round. Meanwhile, it saves a significant amount of energy for communication 
(which is at least three times when compared to its counterparts). 

Dependability verification: Identification of what exactly happened in the structure. 
When computing the mode shapes, we should notice that the signals of the faulty sensors 
contribute to the global mode shape computation. Thus, the mode shape values corresponding to 
the failed sensors are changed drastically. If there is a missing sensor, the mode shape result is 
affected. Considering faulty signals, missing signals (in the case of sensor missing), or irregular 
signals (may be due to the damage), the mode shapes will be affected. But we need to know 
exactly what happened in a WSN-based SHM system so that we can realize whether WSN-based 
SHM is dependable or not. 

Now, we identify exactly what happened in the structure. Recall that there is a possibility 
of both sensor fault and damage occurrence at the same location. If there is a change in the 
signals with a single sensor only, the sensor may be faulty. If the change is present with multiple 
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Fig. 11. (a) Distributed sensor fault detection (5th sensor and 10th sensor are faulty); (b) Mil achieved by the two detection 
methods under the sensor faults. 


sensors, there is possibly damage. However, if there is damage, it cannot be identified before 
the faulty sensor detection. As the fault injected, the 5th sensor should be faulty at a different 
time. As shown in Fig. 12a, during the SHM operation, at =5, the 5th sensor is detected as 
faulty, and at Td=20, the 10th sensor is detected as faulty. The changes in the mode shape are 
computed at those time intervals. 

We inject structural physical damage through removing the plates on the 5th and 10th floors, 
since sensors located at these floors are faulty. They are not able to provide appropriate damaged 
information. We can see in Fig. 12b that the neighbors (4th, 6th, and 9th ) are able to detect an 
extent of changes (i.e., the presence of a damage) in the structure, where the slightly affected 
mode shapes clearly appeared. 

Under the same experiment and excitation setting, we further conduct experiments in which 
a faulty sensor signal is reconstructed by using our algorithm. The mode shape’s curvature is 
recovered significantly at the 10th sensor location, as shown in Fig. 12c. This means that there 
is possibly a damage, since the mode shape is still slightly affected. However, the changes in 
mode shape at the 5th sensor still remains and is slightly recovered at the neighbors. It provides 
the correctness of DependSHM and the dependability in WSN-based monitoring. If there was 
no recovery solution, the damage would not be identified and the changes at the sensor near the 
faulty sensor would not be discovered, which is the exact opposite of the 5th sensor cases. 

Further proof of the damage detection can be seen in Fig. 12d. Here, we replace the plate on 
the same floors. When the sensors wakeup and start monitoring, that time Trf=42. In Fig. 12d, no 
remarkable change appears at the 10th sensor location. Distorted mode shape information at the 
neighbor locations is completely recovered. We can say that no Mil appeared. In contrast, at the 
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Fig. 12. The mode shape’s curvature under sensor faults and recovery from the faults. 


5th location, mode shape remains unreeovered but there is no remarkable mode shape eurvatures 
at the neighbor sensors’ loeation. It proves that the 5th sensor is surely faulty, while there was 
damage at the 10th sensor at this period of monitoring (but whieh is not elearly deteeted as the 
prior damage). 

Through an analysis, the quality of the faulty sensor signal reeonstruetion is about 92% 
eompared to the base-line results under the fault-free eondition (as shown in Fig. 10, and TABLE 
G1 in Appendix G). The inferenee ean be drawn from the above analysis that, in the presenee 
of sensor faults, a damage ean be sueeessfully deteeted in DependSHM. 

VIII. Conclusion 

In this paper, we proposed a dependable WSN-based SHM seheme, DependSHM, by making 
the best use of resouree-eonstrained WSNs for SHM and ineorporating requirements of both 
engineering and eomputer seienee domains. DependSHM ineludes two eomplementary algo¬ 
rithms for sensor fault deteetion and faulty sensor’s signal reeonstruetion. It is able to provide 
the quality of SHM in the presenee of sensor faults automatieally, whieh does not need any 
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network maintenance for the fault detection and recovery, and does not consume significant WSN 
resources for the recovery. In the future, we plan to study decentralized computing architectures 
in WSNs, which can be integrated by the computing system issues and structural engineering 
system techniques in conjunction. Such an architecture is highly expected to reduce data traffic 
for data-intensive SHM and energy cost in WSNs. 
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Appendix A 

Extended Energy Cost Model (cosT(ei)) 

One of our important objectives is to minimize the energy cost of the network regarding various aspects, including 
the sensor fault detection and recovery, damage event detection. Let cost{ei) denote the total energy cost of sensor 
i, including measurement, computation, transmission, and overhead. The sensor i has discrete power level that it 
can adjust it in ranges from i?niin to i?max- In the beginning, sensor i adopts its minimum power level and then i 
may dynamically increase it. 

We describe here how energy consumed in transmitting a packet. The maximum energy cost of i depends on 
the routing protocol used by the data collection application. 

Consider a shortest path routing model ifOl . jlbl ; there is a path from sensor i to neighboring sensor node or 
the BS j: q = zg, zi... Zk- Sensor i propagates the data to them. We can find the zth hop sensor on each path and 
calculate the amount of traffic that passes along on the paths within each round of monitoring data collection (T^, 
d=l,2,..n). Then, the cost{ei) can be decomposed into the following four parts: 

0031 ( 62 ) — C-comp C-samp f^oh (^^3) 

We describe these terms in the following: 

• ct is the total energy cost for data transmission in a round of data transmission over a link between a 
transmitter and a receiver, where sensor i uses its power level from a minimum to a maximum, but not 
beyond the maximum power. We use a standard energy cost model for calculating the packet transmission 
cost 1481 . 

• Ccomp is the energy cost for processing data locally, e.g., computing equation (6) in Section 4.2. If a sensor is 
allowed to transmit the raw vibration data to the BS directly, Ccomp would be very low. The cost is mainly due 
to the onboard processor, such as a micro-controller, DSP chip, or FPGA 1491 . These devices consume energy 
proportional to the number of processing cycles, as well as the maximum processor frequency /, switching 
capacitance fi, and hardware specific constants k and [3, respectively l49l . We focus on the number of cycles 
taken for tasks, e.g., equation (6) and the amount of samples taken. The number of cycles required to perform 
a task on the amount of samples (denoted by w) are estimated according to the computational complexity 
0{w), which describes how many basic operations, i.e., averages, additions, multiplications, etc., must be 
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Fig. Bl. Based on acquired vibration signal characteristics, measured natural frequencies captured by sensor 1 and sensor 2 
under manual input excitation on the structure, respectively. This shows the structural system oscillation (moving back and forth) 
between its original state and its displaced state, captured by the two in their vicinity. 


performed in executing the task. Given these parameters, the computational energy to complete a task can be 
calculated according to: 

f 

Gcomp = 0{w) ■ + 13) (A24) 

• Csomp is the energy required for a sampling cost of M data points; when sensors capture vibration signals, 
assuming a maximum 50% overlapping, M = {nal‘2. + 1/2) • Cr, where Ua and Cr are the number of averages 
mainly for the purpose of noise reduction, that practically ranges from 10 to 20 and cross-correlation factor, 
respectively ||3?1 . B3l . We assume that no and Cr are set by fixed values on a sensor. 

• Coh is any additional overhead for some causes, e.g., fault detection and signal reconstruction, copying data 
to a local buffer, and network latency. 


Appendix B 

Method of Extracting Local Mode Shape 

In Section 4.2, we have described the state-space model for structural mode shape computation ($) at individual 
sensor. Here, we show a method to local $ extraction and explain benefits of utilizing the extracted $ over / 
towards sensor fault detection and tolerance. 

Definition Bl [Natural Frequency]. Every structure has a tendency to vibrate with much larger amplitude at some 
frequencies than others. Each such frequency is called a natural frequency denoted by f. f is an internal vibration 
signal characteristic of structure, and is different for different structures (such as from building to bridge, from 
indoor to outdoor). In other words, it is defined as the number of times that a structural system oscillates (moves 
back and forth) between its original state and its displaced state when assuming there is no outside interference. 

Definition B2 [Mode shape]. When subjected to external forces, the response of a structure is conceptually 
similar to the response of a vibrating string or structural components such as a metal plate. Upon excitation, the 
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Fig. B2. (a) The finite element model (FEM) of our designed physical infrastructure; (b) its original mode shape; (c)-(e) its 

three mode shapes: mode 1, mode 2, and mode 3. FEM is a computer based numerical model often used for calculating the 
behavior and strength of structural mechanics, such as vibration and displacement. 


vibrations are a combination of several harmonics (or at a specific frequency of vibrations), known as modes. Each 
mode deforms the structure into a particular spatio-temporal pattern known as a mode shape, denoted by $. 

A. Local Mode Shape by Each Sensor 

As the network modeled in Section 3.1, m sensors are available for deployment on a structure, and they extract 
a total of p mode shapes from the measurement of m sensors. The corresponding natural frequencies and mode 
shapes are represented, respectively, as follows; 


{=[f\f,...,n (Bi) 


(B2) 

where f^{k = is the natural frequency, is the mode shape corresponding to ' (f’ii'i- = 

1,... ,m) is the value of at the sensor. For example. Fig. Bl and Fig. B2 illustrate the first two sensors’ 
natural frequencies and corresponding mode shapes of a physical structure, receptively, which are extracted from 
measurements of 10 deployed sensors in our prototype system. In the experiment, vertical accelerations at all the 
given sensors are obtained, and 10% noise is added to all measurements. Under the artificial input excitation, the 
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measured accelerations (the peak frequency pointed by 1,2 ,...) at sensors 1 and sensor 2, respectively, refer to 
Fig. B1 and Fig. B2 (which are obtained by using network topology in Fig. 3). 

B. $ over f in Sensor Fault Detection 

The difference between / and $ can be observed by comparing (Cl) with (C2) and Fig. Cl with Fig. C2. 
According to the ACSM theory, / is not suitable characteristic for damage event detection due to several reasons; 

(i) / is not a sensitive indicator to damage event, where only severe damage event causes noticeable change on 
the set of /; 

(ii) Due to the global property, / does not contain any spatial information, and thus localizing damage event is 
difficult, while damage event detection using / is computationally inefficient; 

(iii) High frequency modes are more susceptible to additional noise than low frequency modes; iv) / is susceptible 
to additional noise ll43l ; To improve the usability of the / to detect damage event of small magnitude, high- 
frequency modes, which are associated with local responses, may be used. However, we argue that adopting 
/ is not suitable for WSNs considering WSNs’ resource limitation; 

(iv) Importantly, a large set of / is required to be sent to the BS (e.g., SPEM ifThl . NFMC OH); damage event 
detection is greatly affected if a portion of it is lost during transmission. 

(v) $ is directly linked to topology of the structure and T* highly features the dynamics of the structure. 

On the other hand, it can be seen from (C2) that $ has elements corresponding to each sensor, thus containing 
spatial information. $ and its derivatives have been proven to be very sensitive features to detect damage event. 
It takes into account out-of-frequency-bandwidth modes of the structure, and is also applicable to a complex 
linear structure. This is why we target on T* computation and observe the impact of sensor faults on $. However, 
theoretically, $ is a global parameter of a structure which means that, using sensor deployed on different locations 
of a structure, the same set of T* may not be obtained. To mitigate this problem, we allows each sensor estimate T* 
taking measurements about its vicinity (i.e., local structural response). 

In this paper, we utilize the mode shape curvature method proposed by civil engineering to identify significant 
change (i.e., damage event) in the mode shape Il50l . The mode shape curvature has high sensitivity to damage event. 

Appendix C 

The Reason of Neglecting Damping 

In Section 4.2, we have described the space-space model for the structural response measurements by sensors. 
In (6), we have considered the matrices of mass and stiffness coefficients of the various elements of the structure, 
but we have neglected the damping. 

Damping is neglected in the model (6), considering individual sensor measurement estimation. In any case, (a) 
faults in the sensors will only be identihed if they cause changes in the response of a greater magnitude than the 
errors in the estimated mode shape, and (b) the modes with low damping, having approximately real modes, will 
be strongly excited. Thus, undamped mode shapes can be accurately estimated by (6) at each sensor. 
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Fig. D. Graphical representation of the state-space equation based Kalman filter that is used in sensor’s faulty signal 
reconstruction. 


Appendix D 

The State-space-equation based KF for Signae Analysis 

In Section 6, we proposed the Kalman filter (KF) technique for signal reconstruction. In this Appendix, a 
graphical representation of the state-space equation based KF is presented in Fig. D. This equation is made with 
the help of the state-space representation of the structural system, as described in Section 6.1. In Fig. D, ut is 
the structural excitation at a specific frequency at time t. Mt and Kt are transition matrices. The signals crt_i 
and at represent the measurement noises, respectively. When measuring the responses of a dynamical structural 
system by wireless sensors, the actual signals produced by the sensors are contaminated by noises due to internal 
manufacturing defects, physical interference, or external environmental effects. According to features of the KF, we 
assume that every measurement from the wireless sensors contains noises; thus, if the noise measurement is zero, 
the KL collapses. Setting the mean of noise as zero is a common practice: E[at] = 0. Noises are assumed to be 
independent of each other, and are normally distributed with covariance matrices, c„ = [crcr^]. The underlying KF 
information is that KF is a recursive algorithm consisting of a loop, which is passed through for each time instant 
t. 


Appendix E 

Missing Sensor Detection Method 

In Section 5, we presented Algorithm 2 for faulty sensor detection. However, if a sensor is missing or is out of 
service during the monitoring operation, if the sensor cannot be reached because of communication constraint or 
failure, or there is an unknown reason, the algorithm cannot guarantee detection of such a sensor node. In order 
to detect these sensors, we apply a method of Kullback-Leibler divergence (KL) lISTIl between the measured and 
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Fig. E. Missing or failed node detection in a WSN-based system: (a) an example of Mil change in the sensor signals; (b) the 
detection result under the KL-KF. 


estimated sensor signals and update Kalman filter (KF) with KL, which can be used as a fault indicator 
KL-KF (Kullback-Leibler-Kalman-Filter) for such sensors. Note that for a faulty sensor signal reconstruction, we 
will use the Kalman filter (KL) technique. 

The symmetrized form of the KL between the probability distributions of one measured signal (t/i) at time t 
and with the KF estimated signal iji is as follows; 

= by. - P!/.]log2— (El) 

where py^ is the probabilities based on the number of points falling into the ith bin. When the KL between two 
probability distributions is zero, the signals are identically distributed. The fault indicator is defined as; 

^ Pmax 

^^ l<p< (E2) 

Pmax 4 

p=l 

If the faulty sensor p is not used for the estimation of the Pmax sensor signals, then the KL distance between 
the measured and estimated signals will be minimal; otherwise, the distance will be higher. This is shown under 
the network topology in Fig. 3a. It can be seen in Fig. D that without sensor 5, the best estimation is possible, 
which clearly indicates the sensor is faulty. This method based on KF is able to detect a missing or failed sensor. 
Also, pure bias faults with the Mil method are enhanced further by this KL-KF method. Thus, using KF-KL with 
the help of Algorithm 2, it can be guaranteed to detect the fault types that produce faulty readings. 

We illustrate the justification of sensor fault identification method based on Mil (i.e., w) through Algorithm 2. 
In our real experiment, under the manual random excitation and 5th sensor removal, the 5th sensor is detected as 
faulty. As shown in Fig. E(a), the relative change in Mil indicates the sensor 5 as faulty. Actually, the sensor was 
removed from the location, however, the sensor is detected as missing by as shown in Fig. E(b). To 

guarantee a certain redundancy of information in each sensor data set, the initial frequency should be available for 
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Fig. FI. Dependability verification: fault detection ability of different schemes. 


identifying the faulty vibration signal. Therefore, if one of the neighboring nodes is missing, the KL-KF divergence 
between the measured and estimated sensor signals can be used as a sensor fault indicator. 

Appendix F 

More results oe WSN-based SHM System Dependability 

In Section 7.1.2, we have partly performed an analysis of the system dependability. In the analysis, we have 
used a combination of true positive and true negative results in the sensor fault detection accuracy estimation. In 
this Appendix, we continue the analysis of the performance of the system dependability. We particularly consider 
the dependability of WSN-based SHM schemes as the ability of fault detection and the ability of structural health 
event (damage) detection of the schemes. 

At first, we discuss the detection ability of different WSN-based schemes. Fig. FI demonstrates the fault detection 
ability of DependSHM and other schemes. We can see that the detection ability of DependSHM is much better 
than that of cSHM, NFMC, and SPEM. NFMC shows higher detection errors than DependSHM, even higher than 
cSHM. Looking into details of causes, we summarize the following observations under the random fault injection: 

(i) The same pick frequencies cannot be achieved in many neighborhoods or clusters in NFMC; 

(ii) One or more clusters are disconnected from the network, as one or more faulty sensors are isolated based on 
the natural frequency comparison (although it shows the good ability rate of fault detection in some clusters); 

(iii) The scheme is limited to the frequency matching based fault detection; 

(iv) NFMC fails to detect other types of faults; 

(v) The fault detection ability of SPEM is very low, due to non-faulty reading losses that results in a increased 
amount of faulty readings; 

(vi) When attempting to recover from the faults, both SPEM and NEMC schemes require a significant amount of 
energy cost. 
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Fig. F2. Dependability verification: structural health event detection ability of different schemes. 


We next examine the system dependability in terms of the structural health event detection ability of a system. 
This can provide us an implication that how much a system can cope with sensor faults and what is the significance 
of addressing dependability issue in a system. We gather all the false positive and false negative cases appeared in 
the WSN-based SHM (achieved from a total of 50 simulation runs), and we get an average. Then, we calculate the 
structural health event detection ability rate as 1-(false positive rates + false negative rates). The results is depicted 
in Fig. F2. We also take into account the structural health monitoring under NO recovery (a preliminary analysis 
has been done based on these results, as illustrated in Fig. 8). Here, we intend to find evidence that what exactly 
happens when there is NO dependability option (fault detection and recovery) provided. 

In Fig. F2, we can see the results, which shows that the structural event detection ability of DependSHM is 
between 93% and 97.2%, which greatly outperforms others. In SPEM, the detection ability under recovery from 
sensor faults tolerance algorithm is inferior (between 75% and 92%) among all of the schemes, while it is between 
74% and 95% in NFMC and 87% to 95% in cSHM. There can be various reasons that SPEM provides poor 
detection rate, including i) centralized decision making on the fault detection and tolerance (data losses on the 
fly is a factor), ii) application-specific sensor deployment, iii) natural frequency matching problem, and so on. In 
NEMC, the peak natural frequency signals used in the sensor fault detection and recovery, by which the actual 
mode shape curvature slightly distorted. This lead to a lower Mil that results in a lower structural event detection 
ability. As it can be seen in Eig. E2, the structural event detection ability becomes lower in NEMC and SPEM than 
in DependSHM and cSHM, as the number of faulty sensor nodes in the WSN increases. 

Erom the results in Eig. E2, the structural event detection ability rate is around 65% in a system with NO recovery 
from sensor faults. It may make us surprised that the monitoring operations in a WSN-based SHM can be often 
meaningless if there is no dependability option provided. Erom a deep observation, we have found evidence that 
faulty sensors can corrupt results of a health event in a structural system without being detected. We have seen that 
measured signals introduced by some faulty sensors often identify its location as damaged (actually it is undamaged 
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location). We also have found that some faulty sensor identify its location as undamaged (actually the location is 
damaged). There are a large number of such wrong diagnoses (false positive and false negative) that lead to a 
reduced structural event detection ability. 


Appendix G 

More Details oe the WSN Prototype System Implementation 

A. Extended Detail of the Experimental Setup 

We validate DependSHM by implementing a proof-of-concept system using the TinyOS on Imote2 platforms 
im. Our main objective is to verify i) the dependability and ii) the energy-efficiency of the system. We target the 
accuracy or successful T* identification, because it can provide us with the answer, whether or not a WSN-based 
SHM system is dependable in terms of various sensor faults. 

The Imote2 (IPR2400) is an advanced wireless sensor platform (off-the-shelf), offering sufficient processing 
capability and communication resources to locally and continuously monitor vibration characteristics under intensive 
conditions. Its main board combines a low power PXA271 XScale processor with an 802.15.4 radio (CC2420) and 
an antenna using 2.4 GHz. The major limitation with it is the energy. 

We employ 10 integrated Imote2s called SHM motes on a test structure, as shown in Fig. Gl; an additional Imote2 
is located 15 meters away as the BS mote, and a PC as a command center for the BS mote and data visualization. 
The test structure has 10 floors; at each floor, a mote is deployed to monitor the structure’s horizontal accelerations. 
Each mote runs a program (implemented in the nesC language) to process the acceleration data acquired from 
on-board accelerometers. The BS receives the data packets from the sensors through wireless communication, and 
relays the data to the PC over a USB cable. The PC commands and sets parameters for the network through BS. 
Java and Matlab are used to calculate and visualize the whole structural health condition. In the experiment, Rmin 
is adjusted by the diameter of the structure, which is adjusted by estimating the height of the test structure and 
each floor. Imote2’s discrete levels of range are set to use Rmin and Rmax- 

B. Sensor Identified Natural Erequencies 

In Section 7.2.2, we have given experimental mode shapes, estimated based on natural frequencies. In the first set 
of experiments, we compute the natural frequencies, as shown in TABLE Gl. These frequencies are used in creating 
mode shapes ($) in the base-line structural system, when there are no damage events and no sensor faults. Note that 
such a base-line mode shape should be not fixed but should be dynamic, i.e., a WSN-based SHM system can be 
enabled to adapt or update its base-line mode shape, taking into account dynamic environments and environmental 
noise factors. We And that the Mil in different frequencies identified at different sensors is low (the result has been 
shown in Fig. 11). 
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(a) An integrated SHM mote 
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Fig. Gl. (a) The SFIM mote integrated by Imote2; (b) twelve-story test structure and the placement of 10 SHM motes on it; 
(c) their deployment. 


TABLE Gl 

IDENTIHED NATURAL EREQUENCIES BY THE FIRST FIVE SENSORS OF THE EXPERIMENTAL WSN IN SPEM AND 

DependSHM. 


Mode 

Frequencies (Hz) 

Centralized processing (SPEM) 

Localized processing (DependSHM) 


^1 

52 

53 

54 

■^5 

■?! 

52 

53 

54 

■Ss 

1 

13.211 

12.213 

14.131 

15.123 

13.435 

14.134 

13.141 

15.312 

16.856 

14.355 

2 

17.341 

14.798 

15.112 

16.234 

15.141 

17.741 

15.141 

17.214 

17.852 

16.641 

3 

20.834 

21.334 

19.134 

21.434 

19.746 

21.341 

22.932 

21.341 

22.344 

21.341 


C. Signal Reconstruction at a Faulty Sensor 

In Section 7.2.2, we have also provided the sensor fault detection results, where we have found that sensor 5 is 
faulty. In order to the support the results, we hereby can observe the faulty signal reconstruction of the 5th sensor, 
as shown in Fig. G2. The drift in the measured signal (red line) is corrected by the estimated signal (green lines). 
We observe the mutual independence under the fault injection at the 5th sensor. In DependSHM, when sensor nodes 
process data locally, the small value in the Mil is achieved, ranging from 2% to 4%, and they are not considered 
faulty. The Mil provides the best value, when there is a remarkable change in the sensor measured signals, i.e., the 
5th sensor and 10th sensor are faulty. This reveals that there can a better accuracy of fault detection in DependSHM 
in practice, compared to others. 
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Fig. G2. Signal reconstruction of the 5th sensor (that is detected faulty). 
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Fig. G3. The performance on the energy cost of the WSN in different schemes. 


D. Energy Cost (cost(ej)) 

Due to space limitation, we have not presented the performance of energy cost of the WSN in Section 7.2.2, 
which we present in this Appendix. 

We allow all of the sensors to sleep after each monitoring period to perform power management. The TinyOS 
2.0 drivers for the Imote2 supports putting all of the hardware to sleep when it is switched off. This is obvious 
for a WSN-based SHM system, since a WSN does not always need to run actively in case of specific structural 
event monitoring. For example, in case of aerospace vehicle monitoring, when it is not flying, the WSN may not 
need monitoring operations. In another case, the WSN can be scheduled to run periodically or a part of the sensors 
can be scheduled to wake up periodically and check health event status. cost{ei) is calculated by the energy cost 
for computation, transmission, measurement, and overhead, where the overhead statistics with current cost data is 
combined. The data sheet can be found in El- 

Fig. G3 shows the energy cost of a round of monitoring, T^^i. The DependSHM method significantly decreases 
the energy cost compared to SPEM, from 0.197 mAh to 0.072 mAh. The reason is that the major energy is consumed 
by the raw signal transmissions to the BS. The actual computation cost in DependSHM is 0.0072 mAh to execute 
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Fig. G4. The performance on the energy cost of the WSN in the first five round of monitoring in DependSHM and NFMC. 


the basic equations and fault detection and signal reconstitution. However, it fully depends on the number of cycles 
that a sensor CPU requires. It also varies from sensor to sensor based on the tasks a needs to do. A sensor does 
not need computation for signal reconstruction if there is not fault. In such a case, a sensor can save an average 
of 0.0027 mAh. More importantly, in DependSys, the computation saves the Imote2 an average of 0.165 mAh 
during transmission, since it reduces the time that the CC2420 radio is active. The overhead is caused by end-to- 
end transmission delay and writing/reading data to/from Imote2’s memory, since we depend on local processing. In 
both SPEM and NFMC methods, transmitting a large amount of raw data in each (i.e., transmission of natural 
frequency sets and frequent retransmissions caused by packet losses) increases costiei). However, NFMC achieves 
slightly lower energy cost for transmission than SPFM. 

Further performance analysis of cost{ei) in five rounds of monitoring (Td, d = 1, 5) can be seen in Fig. G4. 
This shows the actual amount of energy cost required in DependSHM. We can see that DependSHM outperforms 
NFMC significantly because of the above causes, cluster maintenance, and network maintenance (e.g., faulty sensor 
isolation), particularly the set of mode shapes transmitted from the cluster-head to the BS. This is because the final 
mode shapes of each cluster is transmitted by each cluster-head, while SPFM requires transmission of all natural 
frequency sets. In our distributed solution, there is no frequent retransmission and the final mode shapes transmitted 
by each sensor are without sensor fault information. In the case of faulty sensor detection and signal reconstruction, 
the system consumes a small amount of energy in computation with a slight overhead, which is 5% to 8% of the 
total energy cost in each round. 

In a concluding remark about the results we have found and presented in this paper, our proposed dependable, 
distributed SHM solution outperforms centralized solution almost in all aspects, including, energy cost of the WSN 
and offering monitoring system dependability. 
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